Search CORE

439 research outputs found

Using Property-Based Testing to Generate Feedback for C Programming Exercises

Author: Ribeiro Rita P.
Vasconcelos Pedro
Publication venue: OASIcs - OpenAccess Series in Informatics. First International Computer Programming Education Conference (ICPEC 2020)
Publication date: 01/01/2020
Field of study

Dagstuhl Research Online Publication Server

SMOTE for regression

Author: Branco Paula
Pfahringer Bernhard
Ribeiro Rita P.
Torgo Luís
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Several real world prediction problems involve forecasting rare values of a target variable. When this variable is nominal we have a problem of class imbalance that was already studied thoroughly within machine learning. For regression tasks, where the target variable is continuous, few works exist addressing this type of problem. Still, important application areas involve forecasting rare extreme values of a continuous target variable. This paper describes a contribution to this type of tasks. Namely, we propose to address such tasks by sampling approaches. These approaches change the distribution of the given training data set to decrease the problem of imbalance between the rare target cases and the most frequent ones. We present a modification of the well-known Smote algorithm that allows its use on these regression tasks. In an extensive set of experiments we provide empirical evidence for the superiority of our proposals for these particular regression tasks. The proposed SmoteR method can be used with any existing regression algorithm turning it into a general tool for addressing problems of forecasting rare extreme values of a continuous target variable

CiteSeerX

Research Commons@Waikato

A Benchmark dataset for predictive maintenance

Author: Gama João
Pereira Pedro M.
Ribeiro Rita P.
Veloso Bruno
Publication venue
Publication date: 18/07/2022
Field of study

The paper describes the MetroPT data set, an outcome of a eXplainable Predictive Maintenance (XPM) project with an urban metro public transportation service in Porto, Portugal. The data was collected in 2022 that aimed to evaluate machine learning methods for online anomaly detection and failure prediction. By capturing several analogic sensor signals (pressure, temperature, current consumption), digital signals (control signals, discrete signals), and GPS information (latitude, longitude, and speed), we provide a dataset that can be easily used to evaluate online machine learning methods. This dataset contains some interesting characteristics and can be a good benchmark for predictive maintenance models

arXiv.org e-Print Archive

Are the States United? An analysis of US hotels’ offers through TripAdvisor’s eyes

Author: Batista F.
Moro S.
Oliveira C.
Ribeiro R.
Rita P.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2019
Field of study

This empirical data-driven research aims to unveil thought-provoking insights on the U.S. hotel offer across its 50 states. Information of more than 30,000 hotels was collected through web scraping from TripAdvisor. Using such data, 50 support vector machine models were trained to model the TripAdvisor score, one per state, to assess the convergent and divergent factors in customer satisfaction across all the U.S. states. A conceptual model is proposed and validated through the data-driven support vector machine models developed for each state to identify convergent features across the states to explain customer satisfaction (here represented by TripAdvisor score). Hotel size, price, and stars are not moderated by the location, expressed by the corresponding state, although these highly influence satisfaction, whereas both hotel number of published photos and the amenities are affected by the location. Thus, adaptation issues were found regarding amenities and published photos within each state’s offer.info:eu-repo/semantics/acceptedVersio

Repositório Institucional do ISCTE-IUL

Repositório da Universidade Nova de Lisboa

Leveraging national tourist offices through data analytics

Author: Batista F.
Moro S.
Oliveira C.
Ribeiro R.
Rita P.
Publication venue: 'Emerald'
Publication date: 01/01/2018
Field of study

Purpose This study aims to propose a data-driven approach, based on open-source tools, that makes it possible to understand customer satisfaction of the accommodation offer of a whole country. Design/methodology/approach The method starts by extracting information from all hotels of Portugal available at TripAdvisor through Web scraping. Then, a support vector machine is adopted for modeling the TripAdvisor score, which is considered a proxy of customer satisfaction. Finally, knowledge extraction from the model is achieved using sensitivity analysis to unveil the influence of features on the score. Findings The model of the TripAdvisor score achieved a mean absolute percentage error of around 5 per cent, proving the value of modeling the extracted data. The number of rooms of the unit and the minimum price are the two most relevant features, showing that customers appreciate smaller and more expensive units, whereas the location of the hotel does not hold significant relevance. Originality/value National tourist offices can use the proposed approach to understand what drives tourists’ satisfaction, helping to shape a country’s strategy. For example, licensing new hotels may take into account the unit size and other characteristics that make it more attractive to tourists. Furthermore, the procedure can be replicated at any time and in any country, making it a valuable tool for data-driven decision support on a national scale.info:eu-repo/semantics/acceptedVersio

Repositório Institucional do ISCTE-IUL

Modelos de previsão de valores extremos e raros

Author: Luis Torgo
Rita P. Ribeiro
Publication venue
Publication date: 01/01/2010
Field of study

Repositório Aberto da Universidade do Porto

Evaluation of non-conventional geothermal potential in a volcanic island

Author: Caldeira Rita
Ribeiro Maria Luísa
Rosa Carlos J. P.
Rosa Diogo R. N.
Publication venue
Publication date: 01/06/2012
Field of study

Repositório do LNEG

COVID-19 : The European institute of oncology as a "hub" centre for breast cancer surgery during the pandemic in Milan (Lombardy region, northern Italy) - A screenshot of the first month

Author: A. Rita Vento
E. Vicini
P. Naninato
P. Veronesi
S. Kahler Ribeiro Fontana
V. Galimberti
Publication venue: 'Elsevier BV'
Publication date: 01/06/2020
Field of study

AIR Universita degli studi di Milano

Characterization of two-year progression of neurodegeneration in different risk phenotypes of diabetic retinopathy

Author: Barreto Patrícia
Coimbra Rita
Cunha-Vaz José
Lobo Conceição
Madeira Maria H.
Marques Inês P.
Ribeiro Luísa
Santos Ana Rita
Santos Torcato
Publication venue: 'SAGE Publications'
Publication date: 01/01/2022
Field of study

To characterize the two-year progression of neurodegeneration in different diabetic retinopathy (DR) risk phenotypes in type 2 diabetes.info:eu-repo/semantics/publishedVersio

Repositório Científico do Instituto Politécnico do Porto

PubMed Central

Estudo Geral